The shortest common supersequence problem in a microarray production setting
نویسنده
چکیده
MOTIVATION During microarray production, several thousands of oligonucleotides (short DNA sequences) are synthesized in parallel, one nucleotide at a time. We are interested in finding the shortest possible nucleotide deposition sequence to synthesize all oligos in order to reduce production time and increase oligo quality. Thus we study the shortest common super-sequence problem of several thousand short strings over a four-letter alphabet. RESULTS We present a statistical analysis of the basic ALPHABET-LEFTMOST approximation algorithm, and propose several practical heuristics to reduce the length of the super-sequence. Our results show that it is hard to beat ALPHABET-LEFTMOST in the microarray production setting by more than 2 characters, but these savings can improve overall oligo quality by more than four percent. AVAILABILITY Source code in C may be obtained by contacting the author, or from http://oligos.molgen.mpg.de.
منابع مشابه
Problems Related to Subsequences and Supersequences
We present an algorithm for building the automaton that searches for all non-overlapping occurrences of each subsequence from the set of subsequences. Further, we define Directed Acyclic Supersequence Graph and use it to solve the generalized Shortest Common Supersequence problem, the Longest Common Non-Supersequence problem, and the Longest Consistent Supersequence problem.
متن کاملSolving Shortest Common Supersequence Problem Using Artificial Bee Colony Algorithm
The idea behind this work is to solve the Shortest Common Supersequence by using Artificial Bee Colony (ABC) Algorithm. This algorithm is considered as one of the newest nature-inspired swarm-based optimization algorithms and has a promising performance [6]. Shortest Common Supersequence is a classical problem in the field of strings and it is classified as NP-Hard problem [12]. Many algorithms...
متن کاملShop Cells
In this paper the Group Technology problem of grouping parts in families and machines in cells in order to convert a job shop production system to some pseudo ow shops is considered. Several heuristic methods for solving Group Technology problem have been proposed in the literature, in general, not producing a ow shop in each cell|unless a great number of extra machines is added. We use known t...
متن کاملR O M a Group Technology with Flow Shop Cells
In this paper the Group Technology problem of grouping parts in families and machines in cells in order to convert a job shop production system to some pseudo ow shops is considered Several heuristic methods for solving Group Technology problem have been proposed in the literature in general not producing a ow shop in each cell unless a great number of extra machines is added We use known techn...
متن کاملMinimum cost multi-product flow lines
In this paper, the problem of finding the minimum cost flow line able to produce different products is considered. This problem can be formulated as a shortest path problem on an acyclic di-graph when the machines graph associated with each product family is a chain or a comb. These graphs are relevant in production planning when dealing with pipelined assembly systems. We solve the problem usi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 19 Suppl 2 شماره
صفحات -
تاریخ انتشار 2003